Multi-lingual neural title generation for e-Commerce browse pages
نویسندگان
چکیده
To provide better access of the inventory to buyers and better search engine optimization, e-Commerce websites are automatically generating millions of easily searchable browse pages. A browse page consists of a set of slot name/value pairs within a given category, grouping multiple items which share some characteristics. These browse pages require a title describing the content of the page. Since the number of browse pages are huge, manual creation of these titles is infeasible. Previous statistical and neural approaches depend heavily on the availability of large amounts of data in a language. In this research, we apply sequence-to-sequence models to generate titles for high& low-resourced languages by leveraging transfer learning. We train these models on multi-lingual data, thereby creating one joint model which can generate titles in various different languages. Performance of the title generation system is evaluated on three different languages; English, German, and French, with a particular focus on lowresourced French language.
منابع مشابه
Generating titles for millions of browse pages on an e-Commerce site
We present three approaches to generate titles for browse pages in five different languages, namely English, German, French, Italian and Spanish. These browse pages are structured search pages in an e-commerce domain. We first present a rule-based approach to generate these browse page titles. In addition, we also present a hybrid approach which uses a phrase-based statistical machine translati...
متن کاملA Multi-task Learning Approach for Improving Product Title Compression with User Search Log Data
It is a challenging and practical research problem to obtain effective compression of lengthy product titles for Ecommerce. This is particularly important as more and more users browse mobile E-commerce apps and more merchants make the original product titles redundant and lengthy for Search Engine Optimization. Traditional text summarization approaches often require a large amount of preproces...
متن کاملClassifying Web pages employing a probabilistic neural network
This paper proposes a system capable of identifying and categorising web pages, on the basis of information filtering. The system is a three layer Probabilistic Neural Network (PNN) with biases and radial basis neurons in the middle layer and competitive neurons in the output layer. The domain of study involves the e-commerce area. Thus, the PNN scopes to identify e-commerce web pages and class...
متن کاملLearning to Describe E-Commerce Images from Noisy Online Data
Recent study shows successful results in generating a proper language description for the given image, where the focus is on detecting and describing the contextual relationship in the image, such as the kind of object, relationship between two objects, or the action. In this paper, we turn our attention to more subjective components of descriptions that contain rich expressions to modify objec...
متن کاملAn Intelligent Information System for Detecting Web Commerce Transactions
This paper proposes an algorithm for detecting web transactions through web page classification. The algorithm is implemented over a generalised regression neural network and detects e-commerce pages classifying them to the respective transaction phase according to a framework, which describes the fundamental phases of commercial transactions in the web. Many types of web pages were used in ord...
متن کامل